Entity Matching in Vector Spatial Data
نویسنده
چکیده
Entity matching is a crucial and hard technology in application of vector spatial data integrating, data updating and map differential analyses. According to the disadvantage of matching algorithms nowadays, from the candidate searching algorithm of entity matching, similarity measure and matching strategy, this paper does a deep research in this three aspects. And proposes an area entity searching algorithm based on interior intersection, and another line entity searching algorithm based on buffer division. Both could enhance the seed of searching candidate match set, decreases the number of candidate matching entities. Also in this paper, giving an area entity integrated similarity measure index integrated entity area, overlap area, barycentre distance etc, and a line entity integrated similarity measure index based on buffer overlap area, azimuth code, length etc. . Both of them improve the recognizing ability of identical entity. Through the match strategy of bidirectional matching and clustering combination, effectively achieves the entity matching of one-to-many and many-to-many. ∗ Corresponding author: FU Zhongliang, Email: [email protected]
منابع مشابه
Adaptive Approximate Record Matching
Typographical data entry errors and incomplete documents, produce imperfect records in real world databases. These errors generate distinct records which belong to the same entity. The aim of Approximate Record Matching is to find multiple records which belong to an entity. In this paper, an algorithm for Approximate Record Matching is proposed that can be adapted automatically with input error...
متن کاملA Multi-Scale Residential Areas Matching Method Using Relevance Vector Machine and Active Learning
Multi-scale object matching is the key technology for upgrading feature cascade and integrating multi-source spatial data. Considering the distinctiveness of data at different scales, the present study selects residential areas in a multi-scale database as research objects and focuses on characteristic similarities. This study adopts the method of merging with no simplification, clarifies all t...
متن کاملCommon Spatial Patterns Feature Extraction and Support Vector Machine Classification for Motor Imagery with the SecondBrain
Recently, a large set of electroencephalography (EEG) data is being generated by several high-quality labs worldwide and is free to be used by all researchers in the world. On the other hand, many neuroscience researchers need these data to study different neural disorders for better diagnosis and evaluating the treatment. However, some format adaptation and pre-processing are necessary before ...
متن کاملPABench: Designing a Taxonomy and Implementing a Benchmark for Spatial Entity Matching
The tremendous increase of data sources containing spatial information is bound up with the diversity of geospatial applications such as location-based services (LBS) and global positioning systems. LBS providers use maps to locate spatial entities referring to points of interest (POI), for instance restaurants or locations of events. In our study, we specifically focus on places that tourists ...
متن کاملMatching of Polygon Objects by Optimizing Geometric Criteria
Despite the semantic criteria, geometric criteria have different performances on polygon feature matching in different vector datasets. By using these criteria for measuring the similarity of two polygons in all matchings, the same results would not have been obtained. To achieve the best matching results, the determination of optimal geometric criteria for each dataset is considered necessary....
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008